Overview
Brought to you by YData
Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 1000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 33 |
| Duplicate rows (%) | 3.3% |
| Total size in memory | 390.1 KiB |
| Average record size in memory | 399.5 B |
Variable types
| Text | 1 |
|---|---|
| Numeric | 7 |
| Categorical | 4 |
| Dataset has 33 (3.3%) duplicate rows | Duplicates |
engine is highly overall correlated with max_power and 2 other fields | High correlation |
km_driven is highly overall correlated with year | High correlation |
max_power is highly overall correlated with engine and 2 other fields | High correlation |
seats is highly overall correlated with engine | High correlation |
selling_price is highly overall correlated with engine and 3 other fields | High correlation |
transmission is highly overall correlated with max_power and 1 other fields | High correlation |
year is highly overall correlated with km_driven and 1 other fields | High correlation |
seller_type is highly imbalanced (52.7%) | Imbalance |
Reproduction
| Analysis started | 2024-11-27 19:15:04.813060 |
|---|---|
| Analysis finished | 2024-11-27 19:15:24.361511 |
| Duration | 19.55 seconds |
| Software version | ydata-profiling vv4.12.0 |
| Download configuration | config.json |
Variables
name
Text
| Distinct | 621 |
|---|---|
| Distinct (%) | 62.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 80.1 KiB |
Length
| Max length | 49 |
|---|---|
| Median length | 39 |
| Mean length | 24.857 |
| Min length | 11 |
Unique
| Unique | 440 ? |
|---|---|
| Unique (%) | 44.0% |
Sample
| 1st row | Mahindra Xylo E4 BS IV |
|---|---|
| 2nd row | Tata Nexon 1.5 Revotorq XE |
| 3rd row | Honda Civic 1.8 S AT |
| 4th row | Honda City i DTEC VX |
| 5th row | Tata Indica Vista Aura 1.2 Safire BSIV |
| Value | Count | Frequency (%) |
| maruti | 290 | 6.2% |
| hyundai | 198 | 4.2% |
| tata | 106 | 2.3% |
| mahindra | 90 | 1.9% |
| swift | 83 | 1.8% |
| diesel | 83 | 1.8% |
| bsiv | 79 | 1.7% |
| vxi | 74 | 1.6% |
| 1.2 | 71 | 1.5% |
| plus | 64 | 1.4% |
| Other values (495) | 3549 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3687 | 14.8% | |
| a | 1852 | 7.5% |
| i | 1631 | 6.6% |
| t | 1253 | 5.0% |
| r | 1094 | 4.4% |
| o | 1010 | 4.1% |
| n | 934 | 3.8% |
| e | 890 | 3.6% |
| u | 738 | 3.0% |
| S | 701 | 2.8% |
| Other values (57) | 11067 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 24857 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3687 | 14.8% | |
| a | 1852 | 7.5% |
| i | 1631 | 6.6% |
| t | 1253 | 5.0% |
| r | 1094 | 4.4% |
| o | 1010 | 4.1% |
| n | 934 | 3.8% |
| e | 890 | 3.6% |
| u | 738 | 3.0% |
| S | 701 | 2.8% |
| Other values (57) | 11067 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 24857 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3687 | 14.8% | |
| a | 1852 | 7.5% |
| i | 1631 | 6.6% |
| t | 1253 | 5.0% |
| r | 1094 | 4.4% |
| o | 1010 | 4.1% |
| n | 934 | 3.8% |
| e | 890 | 3.6% |
| u | 738 | 3.0% |
| S | 701 | 2.8% |
| Other values (57) | 11067 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 24857 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3687 | 14.8% | |
| a | 1852 | 7.5% |
| i | 1631 | 6.6% |
| t | 1253 | 5.0% |
| r | 1094 | 4.4% |
| o | 1010 | 4.1% |
| n | 934 | 3.8% |
| e | 890 | 3.6% |
| u | 738 | 3.0% |
| S | 701 | 2.8% |
| Other values (57) | 11067 |
year
Real number (ℝ)
High correlation 
| Distinct | 24 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2013.681 |
| Minimum | 1995 |
|---|---|
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 1995 |
|---|---|
| 5-th percentile | 2006 |
| Q1 | 2011 |
| median | 2014 |
| Q3 | 2017 |
| 95-th percentile | 2019 |
| Maximum | 2020 |
| Range | 25 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.0121486 |
|---|---|
| Coefficient of variation (CV) | 0.001992445 |
| Kurtosis | 1.2158841 |
| Mean | 2013.681 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -1.0223557 |
| Sum | 2013681 |
| Variance | 16.097336 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2017 | 134 | |
| 2016 | 106 | |
| 2015 | 96 | |
| 2018 | 91 | |
| 2011 | 85 | |
| 2012 | 83 | |
| 2014 | 79 | |
| 2013 | 76 | |
| 2019 | 64 | |
| 2010 | 49 | 4.9% |
| Other values (14) | 137 |
| Value | Count | Frequency (%) |
| 1995 | 1 | 0.1% |
| 1998 | 1 | 0.1% |
| 1999 | 5 | 0.5% |
| 2000 | 1 | 0.1% |
| 2001 | 2 | 0.2% |
| 2002 | 4 | 0.4% |
| 2003 | 8 | 0.8% |
| 2004 | 10 | |
| 2005 | 10 | |
| 2006 | 20 |
| Value | Count | Frequency (%) |
| 2020 | 4 | 0.4% |
| 2019 | 64 | |
| 2018 | 91 | |
| 2017 | 134 | |
| 2016 | 106 | |
| 2015 | 96 | |
| 2014 | 79 | |
| 2013 | 76 | |
| 2012 | 83 | |
| 2011 | 85 |
selling_price
Real number (ℝ)
High correlation 
| Distinct | 274 |
|---|---|
| Distinct (%) | 27.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 617901.04 |
| Minimum | 31000 |
|---|---|
| Maximum | 6000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 31000 |
|---|---|
| 5-th percentile | 100000 |
| Q1 | 250000 |
| median | 434999 |
| Q3 | 670000 |
| 95-th percentile | 1904049 |
| Maximum | 6000000 |
| Range | 5969000 |
| Interquartile range (IQR) | 420000 |
Descriptive statistics
| Standard deviation | 758553.86 |
|---|---|
| Coefficient of variation (CV) | 1.22763 |
| Kurtosis | 21.438457 |
| Mean | 617901.04 |
| Median Absolute Deviation (MAD) | 205000 |
| Skewness | 4.2148309 |
| Sum | 6.1790104 × 108 |
| Variance | 5.7540396 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 300000 | 29 | 2.9% |
| 350000 | 28 | 2.8% |
| 600000 | 28 | 2.8% |
| 550000 | 25 | 2.5% |
| 650000 | 24 | 2.4% |
| 400000 | 24 | 2.4% |
| 250000 | 22 | 2.2% |
| 500000 | 22 | 2.2% |
| 750000 | 22 | 2.2% |
| 450000 | 16 | 1.6% |
| Other values (264) | 760 |
| Value | Count | Frequency (%) |
| 31000 | 1 | 0.1% |
| 33983 | 1 | 0.1% |
| 35000 | 1 | 0.1% |
| 40000 | 1 | 0.1% |
| 45000 | 5 | |
| 46000 | 1 | 0.1% |
| 50000 | 2 | 0.2% |
| 52000 | 2 | 0.2% |
| 55000 | 3 | |
| 55599 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 6000000 | 2 | 0.2% |
| 5500000 | 5 | |
| 5400000 | 2 | 0.2% |
| 5150000 | 3 | 0.3% |
| 4100000 | 1 | 0.1% |
| 3800000 | 2 | 0.2% |
| 3750000 | 1 | 0.1% |
| 3400000 | 1 | 0.1% |
| 3251000 | 1 | 0.1% |
| 3200000 | 8 |
km_driven
Real number (ℝ)
High correlation 
| Distinct | 260 |
|---|---|
| Distinct (%) | 26.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 71393.341 |
| Minimum | 1303 |
|---|---|
| Maximum | 375000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 1303 |
|---|---|
| 5-th percentile | 9190 |
| Q1 | 37000 |
| median | 61500 |
| Q3 | 100000 |
| 95-th percentile | 160000 |
| Maximum | 375000 |
| Range | 373697 |
| Interquartile range (IQR) | 63000 |
Descriptive statistics
| Standard deviation | 48486.219 |
|---|---|
| Coefficient of variation (CV) | 0.67914203 |
| Kurtosis | 3.8337561 |
| Mean | 71393.341 |
| Median Absolute Deviation (MAD) | 28500 |
| Skewness | 1.4228571 |
| Sum | 71393341 |
| Variance | 2.3509134 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 120000 | 66 | 6.6% |
| 70000 | 58 | 5.8% |
| 60000 | 55 | 5.5% |
| 80000 | 54 | 5.4% |
| 40000 | 46 | 4.6% |
| 50000 | 44 | 4.4% |
| 90000 | 38 | 3.8% |
| 110000 | 35 | 3.5% |
| 100000 | 33 | 3.3% |
| 30000 | 27 | 2.7% |
| Other values (250) | 544 |
| Value | Count | Frequency (%) |
| 1303 | 1 | 0.1% |
| 2000 | 7 | |
| 2388 | 1 | 0.1% |
| 2600 | 1 | 0.1% |
| 3100 | 1 | 0.1% |
| 3500 | 2 | 0.2% |
| 3564 | 1 | 0.1% |
| 4000 | 1 | 0.1% |
| 4337 | 1 | 0.1% |
| 5000 | 9 |
| Value | Count | Frequency (%) |
| 375000 | 1 | |
| 300000 | 2 | |
| 298000 | 1 | |
| 291000 | 1 | |
| 270000 | 1 | |
| 265000 | 1 | |
| 264000 | 1 | |
| 260000 | 1 | |
| 250000 | 1 | |
| 248000 | 1 |
fuel
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 61.6 KiB |
| Diesel | |
|---|---|
| Petrol | |
| CNG | 5 |
| LPG | 4 |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.973 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Diesel |
|---|---|
| 2nd row | Diesel |
| 3rd row | Petrol |
| 4th row | Diesel |
| 5th row | Petrol |
Common Values
| Value | Count | Frequency (%) |
| Diesel | 534 | |
| Petrol | 457 | |
| CNG | 5 | 0.5% |
| LPG | 4 | 0.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| diesel | 534 | |
| petrol | 457 | |
| cng | 5 | 0.5% |
| lpg | 4 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1525 | |
| l | 991 | |
| D | 534 | 8.9% |
| i | 534 | 8.9% |
| s | 534 | 8.9% |
| P | 461 | 7.7% |
| t | 457 | 7.7% |
| r | 457 | 7.7% |
| o | 457 | 7.7% |
| G | 9 | 0.2% |
| Other values (3) | 14 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5973 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1525 | |
| l | 991 | |
| D | 534 | 8.9% |
| i | 534 | 8.9% |
| s | 534 | 8.9% |
| P | 461 | 7.7% |
| t | 457 | 7.7% |
| r | 457 | 7.7% |
| o | 457 | 7.7% |
| G | 9 | 0.2% |
| Other values (3) | 14 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5973 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1525 | |
| l | 991 | |
| D | 534 | 8.9% |
| i | 534 | 8.9% |
| s | 534 | 8.9% |
| P | 461 | 7.7% |
| t | 457 | 7.7% |
| r | 457 | 7.7% |
| o | 457 | 7.7% |
| G | 9 | 0.2% |
| Other values (3) | 14 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5973 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1525 | |
| l | 991 | |
| D | 534 | 8.9% |
| i | 534 | 8.9% |
| s | 534 | 8.9% |
| P | 461 | 7.7% |
| t | 457 | 7.7% |
| r | 457 | 7.7% |
| o | 457 | 7.7% |
| G | 9 | 0.2% |
| Other values (3) | 14 | 0.2% |
seller_type
Categorical
Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.2 KiB |
| Individual | |
|---|---|
| Dealer | |
| Trustmark Dealer | 28 |
Length
| Max length | 16 |
|---|---|
| Median length | 10 |
| Mean length | 9.628 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Individual |
|---|---|
| 2nd row | Individual |
| 3rd row | Individual |
| 4th row | Individual |
| 5th row | Individual |
Common Values
| Value | Count | Frequency (%) |
| Individual | 837 | |
| Dealer | 135 | 13.5% |
| Trustmark Dealer | 28 | 2.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| individual | 837 | |
| dealer | 163 | 15.9% |
| trustmark | 28 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 1674 | |
| i | 1674 | |
| a | 1028 | |
| l | 1000 | |
| u | 865 | |
| I | 837 | |
| v | 837 | |
| n | 837 | |
| e | 326 | 3.4% |
| r | 219 | 2.3% |
| Other values (7) | 331 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9628 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| d | 1674 | |
| i | 1674 | |
| a | 1028 | |
| l | 1000 | |
| u | 865 | |
| I | 837 | |
| v | 837 | |
| n | 837 | |
| e | 326 | 3.4% |
| r | 219 | 2.3% |
| Other values (7) | 331 | 3.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9628 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| d | 1674 | |
| i | 1674 | |
| a | 1028 | |
| l | 1000 | |
| u | 865 | |
| I | 837 | |
| v | 837 | |
| n | 837 | |
| e | 326 | 3.4% |
| r | 219 | 2.3% |
| Other values (7) | 331 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9628 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| d | 1674 | |
| i | 1674 | |
| a | 1028 | |
| l | 1000 | |
| u | 865 | |
| I | 837 | |
| v | 837 | |
| n | 837 | |
| e | 326 | 3.4% |
| r | 219 | 2.3% |
| Other values (7) | 331 | 3.4% |
transmission
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 62.0 KiB |
| Manual | |
|---|---|
| Automatic |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 6.369 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Manual |
|---|---|
| 2nd row | Manual |
| 3rd row | Automatic |
| 4th row | Manual |
| 5th row | Manual |
Common Values
| Value | Count | Frequency (%) |
| Manual | 877 | |
| Automatic | 123 | 12.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| manual | 877 | |
| automatic | 123 | 12.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1877 | |
| u | 1000 | |
| M | 877 | |
| n | 877 | |
| l | 877 | |
| t | 246 | 3.9% |
| A | 123 | 1.9% |
| o | 123 | 1.9% |
| m | 123 | 1.9% |
| i | 123 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6369 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1877 | |
| u | 1000 | |
| M | 877 | |
| n | 877 | |
| l | 877 | |
| t | 246 | 3.9% |
| A | 123 | 1.9% |
| o | 123 | 1.9% |
| m | 123 | 1.9% |
| i | 123 | 1.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6369 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1877 | |
| u | 1000 | |
| M | 877 | |
| n | 877 | |
| l | 877 | |
| t | 246 | 3.9% |
| A | 123 | 1.9% |
| o | 123 | 1.9% |
| m | 123 | 1.9% |
| i | 123 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6369 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1877 | |
| u | 1000 | |
| M | 877 | |
| n | 877 | |
| l | 877 | |
| t | 246 | 3.9% |
| A | 123 | 1.9% |
| o | 123 | 1.9% |
| m | 123 | 1.9% |
| i | 123 | 1.9% |
owner
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 67.0 KiB |
| First Owner | |
|---|---|
| Second Owner | |
| Third Owner | |
| Fourth & Above Owner | 27 |
| Test Drive Car | 1 |
Length
| Max length | 20 |
|---|---|
| Median length | 11 |
| Mean length | 11.524 |
| Min length | 11 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | First Owner |
|---|---|
| 2nd row | First Owner |
| 3rd row | First Owner |
| 4th row | First Owner |
| 5th row | Second Owner |
Common Values
| Value | Count | Frequency (%) |
| First Owner | 623 | |
| Second Owner | 278 | |
| Third Owner | 71 | 7.1% |
| Fourth & Above Owner | 27 | 2.7% |
| Test Drive Car | 1 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| owner | 999 | |
| first | 623 | |
| second | 278 | 13.5% |
| third | 71 | 3.5% |
| fourth | 27 | 1.3% |
| 27 | 1.3% | |
| above | 27 | 1.3% |
| test | 1 | < 0.1% |
| drive | 1 | < 0.1% |
| car | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 1722 | |
| e | 1306 | |
| n | 1277 | |
| 1055 | ||
| O | 999 | |
| w | 999 | |
| i | 695 | |
| t | 651 | 5.6% |
| F | 650 | 5.6% |
| s | 624 | 5.4% |
| Other values (14) | 1546 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 11524 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 1722 | |
| e | 1306 | |
| n | 1277 | |
| 1055 | ||
| O | 999 | |
| w | 999 | |
| i | 695 | |
| t | 651 | 5.6% |
| F | 650 | 5.6% |
| s | 624 | 5.4% |
| Other values (14) | 1546 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 11524 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 1722 | |
| e | 1306 | |
| n | 1277 | |
| 1055 | ||
| O | 999 | |
| w | 999 | |
| i | 695 | |
| t | 651 | 5.6% |
| F | 650 | 5.6% |
| s | 624 | 5.4% |
| Other values (14) | 1546 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 11524 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 1722 | |
| e | 1306 | |
| n | 1277 | |
| 1055 | ||
| O | 999 | |
| w | 999 | |
| i | 695 | |
| t | 651 | 5.6% |
| F | 650 | 5.6% |
| s | 624 | 5.4% |
| Other values (14) | 1546 |
mileage
Real number (ℝ)
| Distinct | 233 |
|---|---|
| Distinct (%) | 23.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.33748 |
| Minimum | 0 |
|---|---|
| Maximum | 32.26 |
| Zeros | 1 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 12.8 |
| Q1 | 16.55 |
| median | 19.3 |
| Q3 | 22.3 |
| 95-th percentile | 25.5 |
| Maximum | 32.26 |
| Range | 32.26 |
| Interquartile range (IQR) | 5.75 |
Descriptive statistics
| Standard deviation | 3.9517511 |
|---|---|
| Coefficient of variation (CV) | 0.20435709 |
| Kurtosis | 0.003254609 |
| Mean | 19.33748 |
| Median Absolute Deviation (MAD) | 2.8 |
| Skewness | -0.10970283 |
| Sum | 19337.48 |
| Variance | 15.616337 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19.3 | 26 | 2.6% |
| 18.6 | 23 | 2.3% |
| 21.1 | 22 | 2.2% |
| 18.9 | 22 | 2.2% |
| 19.7 | 21 | 2.1% |
| 16.1 | 17 | 1.7% |
| 17 | 16 | 1.6% |
| 12.8 | 16 | 1.6% |
| 22.74 | 15 | 1.5% |
| 18.2 | 15 | 1.5% |
| Other values (223) | 807 |
| Value | Count | Frequency (%) |
| 0 | 1 | 0.1% |
| 9.5 | 1 | 0.1% |
| 10.5 | 3 | |
| 10.75 | 1 | 0.1% |
| 10.91 | 2 | |
| 10.93 | 1 | 0.1% |
| 11 | 1 | 0.1% |
| 11.18 | 1 | 0.1% |
| 11.2 | 2 | |
| 11.36 | 2 |
| Value | Count | Frequency (%) |
| 32.26 | 1 | 0.1% |
| 28.4 | 11 | |
| 28.09 | 2 | 0.2% |
| 27.39 | 5 | |
| 27.3 | 3 | 0.3% |
| 27.28 | 2 | 0.2% |
| 26.6 | 1 | 0.1% |
| 26.59 | 5 | |
| 26.21 | 2 | 0.2% |
| 26 | 10 |
engine
Real number (ℝ)
High correlation 
| Distinct | 88 |
|---|---|
| Distinct (%) | 8.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1454.876 |
| Minimum | 624 |
|---|---|
| Maximum | 3604 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 624 |
|---|---|
| 5-th percentile | 796 |
| Q1 | 1197 |
| median | 1248 |
| Q3 | 1582 |
| 95-th percentile | 2523 |
| Maximum | 3604 |
| Range | 2980 |
| Interquartile range (IQR) | 385 |
Descriptive statistics
| Standard deviation | 521.99574 |
|---|---|
| Coefficient of variation (CV) | 0.35879054 |
| Kurtosis | 0.90097598 |
| Mean | 1454.876 |
| Median Absolute Deviation (MAD) | 248 |
| Skewness | 1.1890629 |
| Sum | 1454876 |
| Variance | 272479.55 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1248 | 135 | 13.5% |
| 1197 | 105 | 10.5% |
| 796 | 63 | 6.3% |
| 998 | 57 | 5.7% |
| 1396 | 51 | 5.1% |
| 2179 | 49 | 4.9% |
| 1498 | 47 | 4.7% |
| 2494 | 32 | 3.2% |
| 1199 | 31 | 3.1% |
| 1497 | 23 | 2.3% |
| Other values (78) | 407 |
| Value | Count | Frequency (%) |
| 624 | 7 | 0.7% |
| 796 | 63 | |
| 799 | 11 | 1.1% |
| 814 | 18 | 1.8% |
| 909 | 1 | 0.1% |
| 936 | 5 | 0.5% |
| 993 | 2 | 0.2% |
| 995 | 2 | 0.2% |
| 998 | 57 | |
| 999 | 7 | 0.7% |
| Value | Count | Frequency (%) |
| 3604 | 1 | 0.1% |
| 3198 | 2 | 0.2% |
| 2993 | 3 | |
| 2987 | 2 | 0.2% |
| 2982 | 5 | |
| 2956 | 6 | |
| 2953 | 1 | 0.1% |
| 2835 | 1 | 0.1% |
| 2755 | 5 | |
| 2696 | 1 | 0.1% |
max_power
Real number (ℝ)
High correlation 
| Distinct | 180 |
|---|---|
| Distinct (%) | 18.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 90.84433 |
| Minimum | 34.2 |
|---|---|
| Maximum | 280 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 34.2 |
|---|---|
| 5-th percentile | 47.3 |
| Q1 | 69 |
| median | 82.425 |
| Q3 | 102 |
| 95-th percentile | 163.94 |
| Maximum | 280 |
| Range | 245.8 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 34.892709 |
|---|---|
| Coefficient of variation (CV) | 0.38409341 |
| Kurtosis | 3.7253344 |
| Mean | 90.84433 |
| Median Absolute Deviation (MAD) | 15.385 |
| Skewness | 1.5940154 |
| Sum | 90844.33 |
| Variance | 1217.5011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 74 | 43 | 4.3% |
| 88.5 | 29 | 2.9% |
| 82 | 29 | 2.9% |
| 47.3 | 24 | 2.4% |
| 81.8 | 24 | 2.4% |
| 67.1 | 22 | 2.2% |
| 46.3 | 21 | 2.1% |
| 88.7 | 20 | 2.0% |
| 88.73 | 20 | 2.0% |
| 70 | 19 | 1.9% |
| Other values (170) | 749 |
| Value | Count | Frequency (%) |
| 34.2 | 2 | 0.2% |
| 35 | 5 | 0.5% |
| 37 | 15 | |
| 37.48 | 2 | 0.2% |
| 38 | 1 | 0.1% |
| 45 | 1 | 0.1% |
| 46.3 | 21 | |
| 47.3 | 24 | |
| 52 | 1 | 0.1% |
| 52.8 | 4 | 0.4% |
| Value | Count | Frequency (%) |
| 280 | 1 | 0.1% |
| 270.9 | 1 | 0.1% |
| 254.79 | 2 | 0.2% |
| 241 | 1 | 0.1% |
| 235 | 2 | 0.2% |
| 214.56 | 3 | 0.3% |
| 204 | 1 | 0.1% |
| 197 | 2 | 0.2% |
| 190 | 9 | |
| 187.74 | 1 | 0.1% |
seats
Real number (ℝ)
High correlation 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.403 |
| Minimum | 4 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 5 |
| median | 5 |
| Q3 | 5 |
| 95-th percentile | 7 |
| Maximum | 9 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.91292082 |
|---|---|
| Coefficient of variation (CV) | 0.16896554 |
| Kurtosis | 1.890972 |
| Mean | 5.403 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.6728153 |
| Sum | 5403 |
| Variance | 0.83342442 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 777 | |
| 7 | 161 | 16.1% |
| 4 | 24 | 2.4% |
| 8 | 23 | 2.3% |
| 6 | 8 | 0.8% |
| 9 | 7 | 0.7% |
| Value | Count | Frequency (%) |
| 4 | 24 | 2.4% |
| 5 | 777 | |
| 6 | 8 | 0.8% |
| 7 | 161 | 16.1% |
| 8 | 23 | 2.3% |
| 9 | 7 | 0.7% |
| Value | Count | Frequency (%) |
| 9 | 7 | 0.7% |
| 8 | 23 | 2.3% |
| 7 | 161 | 16.1% |
| 6 | 8 | 0.8% |
| 5 | 777 | |
| 4 | 24 | 2.4% |
Interactions
Correlations
| engine | fuel | km_driven | max_power | mileage | owner | seats | seller_type | selling_price | transmission | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| engine | 1.000 | 0.438 | 0.261 | 0.772 | -0.463 | 0.040 | 0.543 | 0.249 | 0.516 | 0.499 | 0.015 |
| fuel | 0.438 | 1.000 | 0.174 | 0.238 | 0.305 | 0.000 | 0.220 | 0.106 | 0.150 | 0.000 | 0.133 |
| km_driven | 0.261 | 0.174 | 1.000 | 0.048 | -0.202 | 0.164 | 0.236 | 0.142 | -0.328 | 0.243 | -0.597 |
| max_power | 0.772 | 0.238 | 0.048 | 1.000 | -0.348 | 0.067 | 0.341 | 0.255 | 0.666 | 0.582 | 0.211 |
| mileage | -0.463 | 0.305 | -0.202 | -0.348 | 1.000 | 0.080 | -0.445 | 0.090 | -0.019 | 0.221 | 0.316 |
| owner | 0.040 | 0.000 | 0.164 | 0.067 | 0.080 | 1.000 | 0.060 | 0.174 | 0.165 | 0.147 | 0.281 |
| seats | 0.543 | 0.220 | 0.236 | 0.341 | -0.445 | 0.060 | 1.000 | 0.025 | 0.297 | 0.034 | 0.027 |
| seller_type | 0.249 | 0.106 | 0.142 | 0.255 | 0.090 | 0.174 | 0.025 | 1.000 | 0.364 | 0.362 | 0.196 |
| selling_price | 0.516 | 0.150 | -0.328 | 0.666 | -0.019 | 0.165 | 0.297 | 0.364 | 1.000 | 0.628 | 0.710 |
| transmission | 0.499 | 0.000 | 0.243 | 0.582 | 0.221 | 0.147 | 0.034 | 0.362 | 0.628 | 1.000 | 0.308 |
| year | 0.015 | 0.133 | -0.597 | 0.211 | 0.316 | 0.281 | 0.027 | 0.196 | 0.710 | 0.308 | 1.000 |
Missing values
Sample
| name | year | selling_price | km_driven | fuel | seller_type | transmission | owner | mileage | engine | max_power | seats | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Mahindra Xylo E4 BS IV | 2010 | 229999 | 168000 | Diesel | Individual | Manual | First Owner | 14.00 | 2498 | 112.0 | 7 |
| 1 | Tata Nexon 1.5 Revotorq XE | 2017 | 665000 | 25000 | Diesel | Individual | Manual | First Owner | 21.50 | 1497 | 108.5 | 5 |
| 2 | Honda Civic 1.8 S AT | 2007 | 175000 | 218463 | Petrol | Individual | Automatic | First Owner | 12.90 | 1799 | 130.0 | 5 |
| 3 | Honda City i DTEC VX | 2015 | 635000 | 173000 | Diesel | Individual | Manual | First Owner | 25.10 | 1498 | 98.6 | 5 |
| 4 | Tata Indica Vista Aura 1.2 Safire BSIV | 2011 | 130000 | 70000 | Petrol | Individual | Manual | Second Owner | 16.50 | 1172 | 65.0 | 5 |
| 5 | Mahindra Thar CRDe | 2019 | 975000 | 12584 | Diesel | Dealer | Manual | First Owner | 16.55 | 2498 | 105.0 | 6 |
| 6 | Chevrolet Spark 1.0 LS | 2011 | 150000 | 35000 | Petrol | Individual | Manual | First Owner | 18.00 | 995 | 62.0 | 5 |
| 7 | Maruti Ritz ZXi | 2012 | 275000 | 70000 | Petrol | Individual | Manual | Second Owner | 18.50 | 1197 | 85.8 | 5 |
| 8 | Maruti Alto LX | 2011 | 140000 | 72000 | Petrol | Individual | Manual | Second Owner | 19.70 | 796 | 46.3 | 5 |
| 9 | Hyundai Creta 1.6 CRDi SX | 2016 | 850000 | 58000 | Diesel | Individual | Manual | First Owner | 19.67 | 1582 | 126.2 | 5 |
| name | year | selling_price | km_driven | fuel | seller_type | transmission | owner | mileage | engine | max_power | seats | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 990 | Maruti Alto LXi | 2007 | 95000 | 70000 | Petrol | Individual | Manual | Second Owner | 19.70 | 796 | 46.30 | 5 |
| 991 | Honda Brio V MT | 2012 | 376000 | 26000 | Petrol | Individual | Manual | First Owner | 19.40 | 1198 | 86.80 | 5 |
| 992 | Maruti Alto LXi | 2006 | 85000 | 150000 | Petrol | Individual | Manual | Second Owner | 19.70 | 796 | 46.30 | 5 |
| 993 | Maruti 800 DX | 1999 | 52000 | 100000 | Petrol | Individual | Manual | First Owner | 16.10 | 796 | 37.00 | 4 |
| 994 | Maruti Swift Dzire VXi | 2010 | 240000 | 143000 | Petrol | Individual | Manual | First Owner | 17.50 | 1298 | 85.80 | 5 |
| 995 | Hyundai i10 Magna 1.1L | 2008 | 250000 | 100000 | Petrol | Individual | Manual | Second Owner | 19.81 | 1086 | 68.05 | 5 |
| 996 | Hyundai i20 2015-2017 Sportz 1.2 | 2017 | 440000 | 50000 | Petrol | Individual | Manual | Second Owner | 18.60 | 1197 | 81.83 | 5 |
| 997 | Hyundai i20 Era Diesel | 2009 | 340000 | 40000 | Diesel | Individual | Manual | First Owner | 23.00 | 1396 | 90.00 | 5 |
| 998 | Hyundai i10 Asta | 2012 | 350000 | 25000 | Petrol | Individual | Manual | First Owner | 20.36 | 1197 | 78.90 | 5 |
| 999 | Honda City i DTec SV | 2016 | 700000 | 110000 | Diesel | Individual | Manual | First Owner | 26.00 | 1498 | 98.60 | 5 |
Duplicate rows
Most frequently occurring
| name | year | selling_price | km_driven | fuel | seller_type | transmission | owner | mileage | engine | max_power | seats | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2 | Honda Jazz VX | 2016 | 550000 | 56494 | Petrol | Trustmark Dealer | Manual | First Owner | 18.20 | 1199 | 88.70 | 5 | 8 |
| 9 | Jaguar XF 2.0 Diesel Portfolio | 2017 | 3200000 | 45000 | Diesel | Dealer | Automatic | First Owner | 19.33 | 1999 | 177.00 | 5 | 6 |
| 28 | Toyota Camry 2.5 Hybrid | 2016 | 2000000 | 68089 | Petrol | Trustmark Dealer | Automatic | First Owner | 19.16 | 2494 | 157.70 | 5 | 6 |
| 31 | Volvo V40 D3 R-Design | 2018 | 2475000 | 2000 | Diesel | Dealer | Automatic | First Owner | 16.80 | 1984 | 150.00 | 5 | 6 |
| 1 | BMW X4 M Sport X xDrive20d | 2019 | 5500000 | 8500 | Diesel | Dealer | Automatic | First Owner | 16.78 | 1995 | 190.00 | 5 | 4 |
| 17 | Maruti Swift AMT VVT VXI | 2019 | 650000 | 5621 | Petrol | Trustmark Dealer | Automatic | First Owner | 22.00 | 1197 | 81.80 | 5 | 4 |
| 23 | Skoda Rapid 1.6 MPI AT Elegance | 2016 | 645000 | 11000 | Petrol | Dealer | Automatic | First Owner | 14.30 | 1598 | 103.50 | 5 | 4 |
| 25 | Tata Safari Storme EX | 2015 | 503000 | 110000 | Diesel | Individual | Manual | First Owner | 14.10 | 2179 | 147.94 | 7 | 4 |
| 4 | Hyundai Grand i10 1.2 CRDi Sportz | 2017 | 450000 | 56290 | Diesel | Dealer | Manual | First Owner | 24.00 | 1186 | 73.97 | 5 | 3 |
| 10 | Lexus ES 300h | 2019 | 5150000 | 20000 | Petrol | Dealer | Automatic | First Owner | 22.37 | 2487 | 214.56 | 5 | 3 |